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Abstract. It is well-known that a random variable, i.e. a function de- 
fined on a probability space, with values in a Borel space, can be rep- 
resented on the special probability space consisting of the unit interval 
with Lebesgue measure. We show an extension of this to multivariate 
functions. This is motivated by some recent constructions of random 
graphs. 



A fundamental feature of probability theory is that random variables are 
defined as functions on a probability space (ft, J 7 , P), but the actual proba- 
bility space has very little importance, and often no importance at all. For 
example, we have the following well-known basic result, showing that as long 
as we consider a single random variable, we may assume that the probability 
space is ([0, 1],B, dx), the unit interval with the Borel <r-field and Lebesgue 
measure. 

Theorem 1. If f : (f2, T, P) — > M is any random variable, then there exists 
a random variable f : ([0, 1], B, dx) — » R such that f and f have the same 
distribution. 

A standard construction of / is to take the right-continuous inverse of the 
distribution function P(/ < x), f(x) := inf{y : P(/ < y) > x}. 

Moreover, it is also well-known that Theorem [T] extends to vector- valued 
random variables and random variables with values in more general spaces. 
To state a precise result, we recall that a Borel space 0, Appendix Al], also 
called Lusin space 0, III. 16, 111.20(b)], is a measurable space that is isomor- 
phic to a Borel subset of [0,1]. Every Polish space (a complete separable 
metric space) with its Borel cr-field is a Borel space @, Appendix Al], 
III. 17]. (For example, this includes R n , which gives the extension to vector 
valued random variables.) 

Theorem 2. If f : {VI, J- , P) — > S is any random variable with values in 
a Borel space S, then there exists a random variable f : ([0, 1], B, dx) — > S 
such that f and f have the same distribution. 

Proof. This is an almost trivial extension of Theorem [TJ By assumption, 
there exists an isomorphism ip : S — > E, where E C [0, 1] is a Borel set. This 
means that (p is a bijection, and that ip and tp~ l : E — > S are measurable. 
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Extend ip 1 to R by denning tp = so when x £ E for some arbitrary 
s G <S. Then, g := <p o f : (f2, J 7 , P) £ C 1 is a random variable, and 

Theorem [T] gives : ([0, 1], B, dx) — > R with g = g. Let / := ip^ 1 o g : 
([0,l],2?,cfc) ->5. □ 

Remark 3. In fact, every uncountable Borel space T is isomorphic to [0,1] 
[l, Appendix III. 80]. If \i is any continuous probability measure (i.e., a 
probability measure such that every point has measure 0) on a Borel space 
T, then T necessarily is uncountable, and thus there is an isomorphism 
a : T — > [0, 1] which maps // to some continuous Borel measure v on [0,1], and 
hence a is an isomorphism (T,/i) — > ([0, 1],^). Further, if we let F u (x) := 
i/[0, x], then F u : [0, 1] — > [0, 1] is a continuous function that maps v to the 
Lebesgue measure dx. Hence, in Theorem [Q / o F u o a : (T, //) — > R has 

f o F v o a = / = /, and similarly for Theorem [2l Consequently, we can 
replace ([0, 1], B, dx) by (T,fj,) in Theorems Q] and El for any Borel space T 
with a continuous probability measure \x. 

The purpose of this note is to give an elementary proof of an extension 
of Theorem Q] to functions / : f2 m — * R or / : £l m — > S of several variables, 
where m > 1. This is a folklore result, but we do not know any reference, 
so we provide a detailed statement and proof. Our interest in this problem 
comes from the following example, with a construction of random graphs 
used by, for example, Lovasz and Szegedy [H| and Bollobas, Janson and 
Riordan 0] (in different contexts), see further Borgs, Chayes, Lovasz, Sos 
and Vesztergombi [1, 0] ; Diaconis and Janson 0] and further references in 
these papers, and an extension of this example studied in Bollobas, Janson 
and Riordan [3|, where also functions / : Q, m — > [0, 1] with m > 2 are used. 
We use the notation [re] := {1, . . . , re} if n < oo and [oo] := N := {1, 2, . . . }. 

Example 4. Let / : — > [0, 1] be a symmetric measurable function and 
let 1 < re < oo. Define a random graph on the vertex set [re] by first taking 
re i.i.d. random variables (Xi)™ in £1 with distribution P, and then letting, 
conditioned on these random variables, the edges ij with i < j appear 
independently, with the probability of an edge ij equal to f(Xi,Xj). 

If we choose another probability space (Cl,J-, P) and a symmetric func- 
tion / : Cl 2 — > [0, 1], we obtain the same distribution of the corresponding 
two random graphs if the joint distribution of the families (/(-Xj,Xj))j<j 
and (f(Xi,Xj))i<j are equal, where Xj has the distribution P on This 
motivates the following definition, where ©oo denotes the set of all permu- 
tations of N. (For notational convenience we write f(X\, . . . , X m ) also when 
m = oo; this should be interpreted as f(Xi,X2, •••)•) 

Definition. Let (O, J 7 , P) be a probability space and let / a : f2 m ° — > S* a , 
a € A, be a collection of measurable functions, where A is any index set, 
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m a G N* := N U {00} are positive integers or 00, and S a are measur- 
able spaces. We say that the family (f a )a can be represented on a prob- 
ability space (Cl, J 7 , P) if there exists a collection of measurable functions 
f a : Q. ma — * S a , a £ A, such that if X±,X2, ■ ■ ■ are i.i.d. random elements 
of Q with distribution P, and if Xi, X2, ■ ■ ■ are i.i.d. random elements of 
Cl with distribution P, then the collections \fa(X a fu, . . . ,X a i m \) : a G 
A, a G 600} and {f a (X a ^,. . . ,X ct(toc() ) : a G A, a G ©oo} have the same 
distribution. 

Equivalently, if we regard each f a as being defined on Q°° by ignoring all 
but the m a first coordinates, and let Goo act on $7°° in the natural way, then 
the collections {f a cr : a £ A, <tG 600} and |/ a o a : a G A, a G ©00} 
of random variables defined of O 00 and respectively, have the same 
distribution. 

If we have only a single function / : £l m — > 5, it is further equivalent that 
the collections /(X^, . . . , Xi m ) and /(X^, . . . , Xf m ), indexed by sequences 
(ii, . . . , i m ) of distinct integers in N, have the same distribution. In principle, 
the definition could be stated in this way for families {/«} and {f a } too, 
with in general varying (and possibly unbounded) m a , but we leave it to 
the reader to find a nice formulation in this form. 

Remark 5. We may equip Q ma either with the product a-field !F ma or 
with its completion. This does not matter, because if f a is measurable with 
respect to the completed cr-field, it can be modified on a set of (product) 
measure such that it becomes measurable with respect to J Tnia , and this 
does not change any of the distributions above. Similarly, we may assume 
that f a is measurable with respect to J-™"*. 

Remark 6. As is well-known, the representing functions f a are in general 



far from unique. See, for example, [lfl, Theorem 7.28] for a general result 
on equivalence of representations. 

The main result is that every countable collection of functions on any 
probability space, with values in a Borel space, can be represented on 
([0,l},B,dx). 

Theorem 7. Let /j : f2 mi — > Si, i = 1, 2, . . . , be a finite or countable family 
of measurable functions, where rrii G N* and Si are Borel spaces. Then {fi)i 
can be represented on ([0, 1], B, dx). 

Moreover, if every /j is symmetric, then we can choose the representing 
functions on ([0, 1],B, dx) to be symmetric too. 

Proof. First, by the argument in the proof of Theorem [2j it suffices to con- 
sider the case S% = M. for all i. Further, we may by Remark [5] assume that 
each fi is measurable with respect to J rma , without completion. 

Let A := {C C T : \C\ < Ho} be the collection of all countable families 
of measurable sets in $7. Since a countable union |J ■ Cj of families Cj G A 
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also belongs to A, it is easily seen that Uce.4 cr (^) m ^ s a afield on Q m , for 
every m < oo; since further, by the definition of product cr-fields, every set 
in J 7 " 1 belongs to a sub-u-field generated by a countable number of cylinder 
sets, and thus to some cr(C) m , it follows that T m = \JceA a {C) m - Moreover, 
every /j is measurable with respect to the cr-field generated by the countable 
family {a/ E Q. ma : fi(u>') < r} , r £ Q, and each of these thus belongs to the 
product cr-field J 7 ^." defined by some countably generated sub-cr-field J 7 ^ 
of T. Consequently, there exists a countably generated sub-cr-field T§ of T 
such that each /j is J^ li measurable. 

Next, let Ai,A2, ■ ■ ■ be a sequence of subsets of £1 that generate Tq. Let 
h : n -> £> := {0, 1}°° be defined by /i(x) = £ (2? is, topologically, 

the Cantor set.) Then J-q equals the cr-field generated on f2 by h, and thus 
equals the cr-field on O m generated by h m := (h, . . . , h) : VL m -> V m . 
Since fi is J-"™ l -measurable, it follows that /j = gioh mi for some measurable 
function & : £> m * R. If we let O = V, let be the Borel cr-field on V, 
and let P be the probability measure induced by h, it follows immediately 
that the family (<?j)j gives a representation of (fj)i on (T>,P). 

Applying Theorem [2] to the identity function i : (T>,P) — * (D,P), we 
see that there exists a function I : [0, 1] — > T> mapping the Lebesgue mea- 
sure dx to P. Consequently, the mappings gi o r 11 ^ — > R represent on 
([0,l],B,dx). 

It is clear that #j is symmetric if every /j is. □ 

The symmetry statement in Theorem[7]can be extended to include partial 
symmetries, antisymmetries, . . . ; we leave this to the reader. 

Remark 8. Theorem [7] is related to the th eory for exchangeable arrays 
by Aldous [l| and Hoover [g], see Kallenberg [id . Chapter 7] for a detailed 
exposition. Consider for simplicity a single function / : J7 2 — > S. It is 
obvious that the random variables f(Xi,Xj), i ^ j. form an exchangeable 
family. Conversely, the theorem by Aldous-Hoover [10|, Theorem 7.22] says 
that every exchangeable family can be represented on ([0, l],B,dx), but in 
the more general form g(X$,Xi,Xj,Xij). It seems likely that a further study 
of the representations can lead to a proof that we may in this case take g 
independent of X$ and Xij, which would yield Theorem[7]in this case. (And 
presumably this could be extended to the general case.) Nevertheless, such 
a proof would be highly technical, and hardly shorter than the elementary 
proof given above, so we have not pursued this. 
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